Identifying Student Behavior for Improving Online Course Performance with Machine Learning
نویسنده
چکیده
In this study we investigate the correlation between student behavior and performance in online courses. Based on the web logs and syllabus of a course, we extract features that characterize student behavior. Using machine learning algorithms, we build models to predict performance at end of the period. Furthermore, we identify important behavior and behavior combinations in the models. The result of prediction in three tasks reach 87% accurate on average without using any score related features in the first half of the semester. Figure 11 — Accuracy on 10 different periods in BEHP5000 by decision tree (with/without pruning) and random forest (with/without cross validation) ..... 30 Figure 12 — Average time spent on predicting performance on final exam by decision tree (with/without pruning) and random forest (with/without cross validation). Figure 13 — Accuracy on 10 different periods in BEHP5000 by decision tree (with/without pruning) and random forest (with/without cross validation) ..... 33 Figure 14 — Average time spent on predicting success and failure in the first attempt by decision tree (with/without pruning) and random forest Figure 15 — Accuracy on 10 different periods in BEHP5000 by decision tree (with/without pruning) and random forest (with/without cross validation) ..... 36 Figure 16 — Average time spent on predicting performance on final exam by decision tree (with/without pruning) and random forest (with/without cross validation). Table 8 — Vertical actual classes, lateral was predicted classes, the number of samples in that the upper left and lower right is hit the prediction, the number that the upper right and lower left is off the prediction, and give the true and false, respectively, positive, negative represents whether prediction is true and Table 9 — True positive and false positive rates on 10 different periods in BEHP5000 by decision tree (with/without pruning) and random forest Table 10 — True positive and false positive rates on 10 different periods in BEHP5000 by decision tree (with/without pruning) and random forest Table 11 — True positive and false positive rates on 10 different periods in BEHP5000 by decision tree (with/without pruning) and random forest Table 14 — Detected top 3 of the most important behavior combinations on 10 different periods in BEHP5000 by random forest with k-fold cross-validation in prediction of success and failure in the first attempt in the final test .......... 43 Table 15 — Detected top 3 of the most important behavior combinations on 10 different periods …
منابع مشابه
Identifying Student Behaviors Early in the Term for Improving Online Course Performance
In this study we investigate the correlation between student behavior and performance in an online course. We introduce highlevel behavior features derived from the course syllabus and sequential patterns. We propose a random forest algorithm with cross-validation to find correlation between behavior features and student performance. Considering a course with 10 periods, our empirical results i...
متن کاملPredicting Post-Test Performance from Online Student Behavior: A High School MOOC Case Study
With the success and proliferation of Massive Open Online Courses (MOOCs) for college curricula, there is demand for adapting this modern mode of education for high school courses. Online and open courses have the potential to fill a much needed gap in high school curricula, especially in fields such as computer science, where there is shortage of trained teachers nationwide. In this paper, we ...
متن کاملDoes Regular Online Testing Enhance Student Learning? Evidence from a Large First-year Quantitative Methods Course
In recent years, online tools have increasingly contributed to both course delivery and assessment across various domains with mixed quantitative results. We examine the introduction of a regular online assessment tool ('e-quiz') delivered by the WebCT Vista content management system to a large first-year quantitative methods course at UNSW, Sydney (Australia). Unique aspects of this study incl...
متن کاملStudent demographics and success in online learning environments
Understanding the role of student demographics in the online learning environment can assist institutions to make decisions regarding online programs. These decisions go beyond the initial question of whether to invest in online programs or whether to increase or decrease online course availability. This article discusses the relationship between student demographics and success in online learn...
متن کاملLearning Latent Engagement Patterns of Students in Online Courses
Maintaining and cultivating student engagement is critical for learning. Understanding factors affecting student engagement will help in designing better courses and improving student retention. The large number of participants in massive open online courses (MOOCs) and data collected from their interaction with the MOOC open up avenues for studying student engagement at scale. In this work, we...
متن کامل